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Let S n = ^X n X^ where X n = {Xij} is a p x n matrix with 
i.i.d. complex standardized entries having finite fourth moments. Let 
F n (ti,t2,cr) = v / p(x„(ti)*(S„ + o7)~ 1 x n (t 2 ) -x„(ti)*x n (t 2 )ra„(<7)) 
in which a > and m n (a) = f dF ^j T where F Vn (x) is the Marcenko- 
Pastur law with parameter y„ = p/n; which converges to a posi- 
tive constant as n-> oo, and x n (ti) and x n (t2) are unit vectors 
in C p , having indices ti and t 2 , ranging in a compact subset of a 
finite-dimensional Euclidean space. In this paper, we prove that the 
sequence Y n (ti,t2,o) converges weakly to a (2m + 1) -dimensional 
Gaussian process. This result provides further evidence in support 
of the conjecture that the distribution of the eigenmatrix of S n is 
asymptotically close to that of a Haar-distributed unitary matrix. 

1. Introduction. Suppose that {xjk,j,k = 1,2,...} is a double array of 
complex random variables that are independent and identically distributed 
(i.i.d.) with mean zero and variance 1. Let Xj = (x±j, . . . ,x p j)' and X = 
(xi , . . . , x„) , we define 

n 

(1.1) S„ = - Vx fc x* fc = -XX*, 

n n 
k=l 

where x£ and X* are the transposes of the complex conjugates of x^ and 
X, respectively. The matrix S n defined in (1.1) can be viewed as the sample 
covariance matrix of a p-dimensional random sample with size n. When the 
dimension p is fixed and the sample size n is large, the spectral behavior of S n 
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has been extensively investigated in the literature due to its importance in 
multivariate statistical inference [see, e.g., Anderson (1951, 1989)]. However, 
when the dimension p is proportional to the sample size n in the limit; that 
is, ^ — y y > as n — > oo, the classical asymptotic theory will induce serious 
inaccuracy. This phenomenon can be easily explained from the viewpoint of 
random matrix theory (RMT). 

Before introducing our advancement of the theory, we will first give a 
brief review of some well-known properties of S n in RMT. We define the 
empirical spectral distribution (ESD) of S n by 

1 p 

F s "(x) = -^2l(\ J <x), 
i=i 

where Aj's are eigenvalues of S n . First, it has long been known that F Sn (x) 
converges almost surely to the standard Marcenko-Pastur law [MPL; see, 
e.g., Marcenko and Pastur (1967), Wachter (1978) and Yin (1986)] F y (x), 
which has a density function (2ttxu)~ 1 y (b — x)(x — a), supported on [a, b] = 
[(1 — y/y) 2 , (1 + ^/y) 2 ]- For the case y > 1, F y (x) has a point mass 1 — 1/y 
at 0. If its fourth moment is finite, as n—>oo, the largest eigenvalue of S n 
converges to b while the smallest eigenvalue (when y < 1) or the (p — n + l)st 
smallest eigenvalue (when y > 1) converges to a [see Bai (1999) for a review]. 
The central limit theorem (CLT) for linear spectral statistics (LSS) of S n 
has been established in Bai and Silverstein (2004). 

While results on the eigenvalues of S n are abundant in the literature, not 
much work has been done on the behavior of the eigenvectors of S n . It has 
been conjectured that the eigenmatrix; that is, the matrix of orthonormal 
eigenvectors of S n , is asymptotically Haar-distributed. This conjecture has 
yet to be formally proven due to the difficulty of describing the "asymp- 
totically Haar-distributed" properties when the dimension p increases to 
infinity. Silverstein (1981) was the first one to create an approach to char- 
acterize the eigenvector properties. We describe his approach as follows: 
denoting the spectral decomposition of S n by U*AU„, if Xjj is normally 
distributed, U n has a Haar measure on the orthogonal matrices and is in- 
dependent of the eigenvalues in A. For any unit vector x n 6 P, the vector 
Yn = (yi, ■ ■ ■ , Up) = U n x n performs like a uniform distribution over the unit 
sphere in C p . As such, for t G [0, 1], a stochastic process 

X n (t) = VP/2 5>?-1/p) 
\i=i 
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is defined. If z = (z±, . . . , z p )' ~ iV(0, I p ), then y n has the same distribution 
as z/||z|| and X n (t) is identically distributed with 



Applying Donsker's theorem [Donsker (1951)], X n (t) tends to a standard 
Brownian bridge. 

For any general large sample covariance, it is important to examine the 
behavior of the X n (t) process. Silverstein (1981, 1984, 1989) prove that the 
integral of polynomial functions with respect to X n (t) will tend to a normal 
distribution. To overcome the difficulty of tightness, Silverstein (1990) takes 
x n = (±1, . . . , ±l)/^/p so that the process X n (t) will tend to the standard 
Brownian bridge instead. In addition, Bai, Miao and Pan (2007) investigate 
the process X n (t), defined for Tn S n T n ^ ! with (T n /2 ) 2 = T n , a nonnegative 
positive definite matrix. 

However, so far, the process X n (t) is assumed to be generated only by one 
unit vector x n , in C p . This imposes restrictions on many practical situations. 
For example, in the derivation of the limiting properties of the bootstrap 
corrected Markowitz portfolio estimates, we need to consider two unparallel 
vectors simultaneously [see Bai, Liu and Wong (2009) and Markowitz (1952, 
1959, 1991)]. In this paper, we will go beyond the boundaries of their studies 
to investigate the asymptotics of the eigenmatrix for any general large sam- 
ple covariance matrix S n when x n runs over a subset of the p-dimensional 
unit sphere in which = {x n : ||x n 

We describe the approach we introduced in this paper as follows: if V n 
is Haar-distributed, for any pair of p-vectors x and y satisfying x _L y, 
(V n x, V n y) possesses the same joint distribution as 



where z\ and Z2 are two independent p-vectors whose components are i.i.d. 
standard normal variables. As n tends to infinity, we have 



Therefore, any group of functionals defined by these two random vectors 
should be asymptotically independent of each other. We shall adopt this 
setup to explore the conjecture that U n is asymptotically Haar-distributed. 

We consider x and y to be two p- vectors with an angle 6. Thereafter, we 
find two orthonormal vectors ol\ and a.2 such that 




(1.2) 




(1.3) 




x=||x||ai and y = ||y||(ai cos6* + a.2 sin#). 



4 Z. D. BAI, H. X. LIU AND W. K. WONG 

By (1.2) and (1.3), we have 

(1.4) V„x ~p _1 / 2 ||x||zi and V„y ~ p _1 ' 2 ||y||(zx cos# + z 2 sin#). 

Let a > be a positive constant, we now consider the following three quan- 
tities: 

(1.5) x*(S n + (j/)- 1 x, x*(S n , + a/)- 1 y and y*(S n + oT^y- 

We hypothesize that if U n is asymptotically Haar-distributed and is 
asymptotically independent of A, then the above three quantities should 
be asymptotically equivalent to 

(1.6) p _1 ||x|| ||y||z^(A + cr/)~ 1 (zi COS6* + z 2 sin0) and 

P~ 1 1 1 y 1 1 2 ( cos # z i + sin # z 2 ) * ( A + a I) _ 1 (zi cos 9 + z 2 sin 9) , 

respectively. We then proceed to investigate the stochastic processes related 
to these functionals. By using the Stieltjes transform of the sample covari- 
ance matrix, we have 



-i*fA_L n-i _^ (\ l + v-y-^jl+y + a) 2 -^ 

p ZifA + crJ) zi — >■ m(<7) = a.s., 

2ya 

where m(a) is a solution to the quadratic equation 

(1.7) m(l + a — y + yam) — 1 = 0. 

Here, the selection of m(cr) is due to the fact that m(a) — > as a — > oo. By 
using the same argument, we conclude that 

p~ 1 (cos9zi + sin#z 2 )*(A + <t/)~ 1 (zi cos 9 + z 2 sin#) — >• m(a) a.s. 

Applying the results in Bai, Miao and Pan (2007), it can be easily shown 
that, for the complex case, 

(1.8) P' 1/2 [zl(A + a/rV - pm n (a)] — > JV(0, W), 

and for the real case, the limiting variance is 2W, where W = W(cr, a), 
m n (a) is m{a) with y replaced by y n such that 



and 



/ \ l + a-y n - yj{l + y n + a) 2 - Ay n 
2y„o- 

y n = p/n 



W f ai a \ = m{ai)m{a 2 ) 

l-y(l-a 1 ym(ai))(l-a 2 m(a 2 ))' 
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Here, the definitions of "real case" and "complex case" are given in Theorem 
1 as stated in the next section. By the same argument, one could obtain a 
similar result such that 

(1.9) p" 1/2 [(zi cosfl + z 2 sin6>)*(A + aiy 1 (z 1 cos6> + z 2 sin (9) - pm n {a)]. 

We normalize the second term in (1.6) and, thereafter, derive the CLT for 
the joint distribution of all three terms stated in (1.6) after normalization. 
More notably, we establish some limiting behaviors of the processes defined 
by these normalized quantities. 

2. Main results. Let § = § p be a subset of the unit p-sphere indexed 
by an m-dimensional hyper-cube T = [0,2-/r] m . For any m arbitrarily chosen 
orthogonal unit p- vectors Xi, . . . ,x m+ i £ Cf, we define 

8 = {x„(t) = xi costi + X2 sinti cost 2 + • • • + x m sin£i • • • sini m _i cost m 
(2.1) +x m+ isinti • • -sint m _isint m ,t G T}. 

If § is chosen in the form of (2.1), then the inner product x n (ti)*x n (t2) 
is a function of ti and t 2 only (i.e., independent of n). Also, the norm of 
the difference (we call it norm difference in this paper) ||x n (ti) — x n (t2)|| 
satisfies the Lipschitz condition. If the time index set is chosen arbitrarily, we 
could assume that the angle, i? n (ti,t 2 ), between x n (ti) and x n (t 2 ) tends to a 
function of ti and t 2 whose norm difference satisfies the Lipschitz condition. 

Thereafter, we define a stochastic process Y n (u,cr) mapping from the 
time index set T x T x / to § with / = [a±o, 020] (0 < crio < C20) such that 

y n (u,cr) = v /p(x ri (ti)*(S n + cr/) _1 x n (t 2 ) - x n (ti)*x„(t 2 )m n (cr)), 

where (u,cr) = (ti,t 2 ,cr) G T X T X I. 

Remark 1. If the sample covariance matrix S n is real, the vectors x n 
and y n will be real, and thus, the set § has to be defined as a subset of unit 
sphere = {x G W, ||x|| = 1}. The time index can be similarly described 
for the complex case. In what follows, we shall implicitly use the convention 
for the real case. 

We have the following theorem. 

Theorem 1. Assume that the entries o/X are i.i.d. with mean 0, vari- 
ance 1, and finite fourth moments. If the variables are complex, we further 
assume EX^ = and E|A"n| 4 = 2, and refer to this case as the complex 
case. If the variables are real, we assume EX^ = 3 and refer to it as the 
real case. Then, as n — > 00, the process Y n (ti,t2,o~) converges weakly to 
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a multivariate Gaussian process Y(ti,t 2 ,o~) with mean zero and variance- 
covariance function EK(ti,t2,ci)l^(t3,t4,cT2) satisfying 

EY(t 1 ,t 2 ,o- 1 )Y(t 3 ,U,V2)=$(t 1 ,U)$(t 3 ,t 2 )W(a 1 ,o- 2 ) 

for the complex case and satisfying 

EF(ti, t 2 , a 1 )Y(t 3 , U, a 2 ) = (^(t x , t 4 )0(t 3 , t 2 ) + 0(ti, t 3 )d(U, t 2 ))W(<n , <r 2 ) 
for the real case where 

W{ai,a 2 ) = - — —- — -- 

1 -y(l - axm{a 1 ))(l - a 2 m(a 2 )) 

and 

i?(t,s) =limx*(t)x n (s). 

We will provide the proof of this theorem in the next section. We note 
that Bai, Miao and Pan (2007) have proved that 

^[x„(ti)*(S n + a/r^ti) - m n (a)) — )• N(0, W) 

for the complex case and proved that the asymptotic variance is 2W for the 
real case. 

More generally, if x and y are two orthonormal vectors, applying Theorem 
1, we obtain the limiting distribution of the three quantities stated in (1.5) 
with normalization such that 

(x*(S n + cr/) _1 x - m n (a 
x^ + a/rV 
y*(S n + aiy l y - m n (a) 

for the complex case while the asymptotic covariance matrix is 

'2W 

W 

2W 

for the real case. 




Remark 2. This theorem shows that the three quantities stated in (1.5) 
are asymptotically independent of one another. It provides a stronger sup- 
port to the conjecture that U n is asymptotically Haar-distributed than those 
established in the previous literature. 

In many practical applications, such as wireless communications and elec- 
trical engineering [see, e.g., Evans and Tse (2000)], we are interested in ex- 
tending the process Y n (u,a) defined on a region T x T x D where D is a 
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compact subset of the complex plane and is disjoint with the interval [a, b] , 
the support of the MPL. We can define a complex measure by putting com- 
plex mass x*(ti)U*ej-ejU n y(t2) at Xj, the jth eigenvalue of S n , where ej 
is the p-vector with 1 in its jth entry and otherwise. In this situation, the 
Stieltjes transform of this complex measure is 

s n (z) =x*(S n -zl) _1 y, 

where z = fjL + iv with v ^0. When considering the CLT of LSS associated 
with the complex measure defined above, we need to examine the limiting 
properties of the Stieltjes transforms, which lead to the extension of the 
process Y n (u, a) to Y n (u, —z), where z is an index number in D. 

If x*y is a constant (or has a limit, we still denote it as x*y for simplicity), 
it follows from Lemma 6 that 

x*(S n - 2l) -1 y — > x*ys(z), 

where 



- z - y + y^l - z + y) 2 - 4y 
s(z) = { 2yz 

s(z), when ^s(z) < 0, 



when > 0, 



l-z-y + sgn(5(z)) y 7 (1 - z + y) 2 - 4y 

— u^J^u, 

is the Stieltjes transform of MPL, in which, by convention, the square root 
y/z takes the one with the positive imaginary part. When z ^ is real, 
s(z) is defined as the limit from the upper complex plane. By definition, 
m(a) = s(—a + iO) = lim v ^Qs(— a + iv). In calculating the limit, we follow 
the conventional sign of the square root of a complex number that the real 
part of y (—a + iv — 1 — y) 2 — 4y should have the opposite sign of v, and 
thus 



1 + a-y- y/{l + y + a) 2 -4y 

m{a) = . 

2ya 

Now, we are ready to extend the process Y n (u, a) to 

Y n (u,z) = v /p[x^(ti)(S n - zl) _1 x n (t 2 ) -x*(ti)x n (t 2 )s(2;,yn)], 

where s(z,y n ) is the Stieltjes transform of the LSD of S n in which y is 
replaced by y n . Here, z = u + iv with v > or v < 0. Thereby, we obtain the 
following theorem. 

Theorem 2. Under the conditions of Theorem 1, the process Y n (u,z) 
tends to a multivariate Gaussian process Y(u, z) with mean and covariance 
function E(Y (u, z\)Y (u, Z2)) satisfying 

(2.3) E(Y(u,z 1 )Y{u,z 2 )) = ti(t 1 ,U)#(t3M)W(z 1 ,Z2) 
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for the complex case and satisfying 

(2.4) E(Y(u, Zl )Y(u, z 2 )) = (0(ti, t 4 )tf (t 3 , t 2 ) + 7?(ti, t 3 )0(t 4) t 2 )) W(zi, z 2 ) 
for the real case where 

ys{zi)s(z 2 ) 



W( Zl ,z 2 ) 



l-y{l + z 1 s{z 1 )){l + z 2 s{z 2 ))' 



Theorem 2 follows from Theorem 1 and Vitali lemma [see Lemma 2.3 
of Bai and Silverstein (2004)] since both Y(u,z) and Y n (u,z) are analytic 
functions when z is away from [a, b], the support of MPL. 

Suppose that f{x) is analytic on an open region containing the interval 
[a, b] . We construct an LSS with respect to the complex measure as defined 
earlier; that is, 

v 

^/(AjOx^tiJU^-e^yCta). 

3=1 

We then consider the normalized quantity 

X n (f) = Vp( ^/(A J )x*(t 1 )U;e j e;.U n y(t 2 ) 
\j=i 

(2.5) 

-x*(ti)y(t 2 ) J f(x)dF yn (x)\ 
where F y is the standardized MPL. By applying the Cauchy formula 

m=^-i^-dz, 

Ztti J c z — x 
where C is a contour enclosing x, we obtain 

X n (f,u) = (/[<(*l)(Sn " ^I)-V(t 2 ) 

(2.6) 

- X * n (t l )y(t 2 )s n (z)]f(z)dz), 



where C is a contour enclosing the interval [a, b), u = (ti,t 2 ), and 



/s I- z-y n + sgn(^(z))y/ (1- z + y n ) 2 - Ay n 

Sn{z) = 2y~z ' 

Thereafter, we obtain the following two corollaries. 
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Corollary 1. Under the conditions of Theorem 1, for any k functions 
/i, ■•■,/& analytic on an open region containing the interval [a,b], the k- 
dimensional process 



tends to the k- dimensional stochastic multivariate Gaussian process with 
mean zero and covariance function satisfying 



where 9 = #(ti,t4)#(t3,t2) for the complex case and = #(ti,t4)$(t3,t2) + 
t?'(ti,t3)'(9 / (t4,t2) for the real case. Here, C\ andC2 are two disjoint contours 
that enclose the interval [a, b] such that the functions fi,-.-,fk are analytic 
inside and on them. 

Corollary 2. The covariance function in Corollary 1 can also be writ- 
ten as 



where 6 has been defined in Corollary 1. 

3. The proof of Theorem 1. To prove Theorem 1, by Lemma 7, it is suf- 
ficient to show that Y n (u,a) — EY"„(u, a) tends to the limit process Y(u, a). 
We will first prove the property of the finite-dimensional convergence in Sec- 
tion 3.1 before proving the tightness property in Section 3.3. Throughout 
the paper, the limit is taken as n — > oo. 

3.1. Finite- dimensional convergence. Under the assumption of a finite 
fourth moment, we follow Bai, Miao and Pan (2007) to truncate the random 
variables Xij at e n tfri for all i and j in which e n — > before renormalizing the 
random variables to have mean and variance 1. Therefore, it is reasonable 
to impose an additional assumption that \Xu\ < e n \fn for all i and j. 

Suppose Sj denotes the jth column of ^=X n . Let A(<r) = S n + al and 

Aj(er) = A(cr) — SjS*. Let x n and y n be any two vectors in C^. We define 



(X n (/i,ui),...,X n (/ fe ,u fc )) 
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1 



1 



l + s*A-V)s. 

bj(a) = 

and 



1 + n _1 tr At 1 (a) 



b 



1 + n- 1 EtrA^ 1 (cj) ' 

We also define the a-field Tj = c(si, . . . , Sj). We denote by Ej(-) the condi- 
tional expectation when JFj is given. By convention, Eo denotes the uncon- 
ditional expectation. 

Using the martingale decomposition, we have 

n 

A- V) " EA- V) = £( Ej - E J _ 1 )[A _1 (cr) - A7 k \o)] 

= £(E,- - E i _ 1 )^A7 1 (a)s,s*A7l( ( r). 
i=i 

Therefore, 

n 

Y n (u,a) = ^p^(E j - Ej-iJxCtiJ'lA- 1 ^) - A~V)]x(t 2 ) 

i=i 

= v ^^(E j - E i _ 1 )/3 j x(t 1 )*A7 1 (a)s i s*AT 1 ( (T )x(t 2 ). 
i=i 

Consider the if-dimensional distribution of {Y n (ui,<7i), . . . , Y n (ux, &k)} 
where (uj,<7j) = (tji,tj2,0i) € T x T x I. Invoking Lemma 3, we will have 
K 

^c*(Y n fa,ai) - EY n (ui,ai))=> N(0,ct'Ea) 
i=i 

for any constants aj, i = 1, . . . ,K, where 

ol = (ai,...,a K )' 

and 

£„• = EY(tii,ti2,ai)Y(tji,t j2 ,(Tj) = l &{ti 1 ,tj 2 )'&(tj 1 ,ti2)W((Ti,cr j ) 
for the complex case and 

Ey = MW^Wtji.tia) +1?(t il ,t il )l?(t i 2,t i 2))W(<7 ij O- i ) 

for the real case. 

To this end, we will verify the Liapounov condition and calculate the 
asymptotic covariance matrix E (see Lemma 3) in the next subsections. 
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3.1.1. Verification of Liapounov's condition. By (3.1), we have 
K 

Y,a t (Y n (a i )-EY n (a i )) 



(3.2) 



i=l 



K 



= y/p^Ej -E J _ 1 )^(a i /3 J (x*(t ll )AT 1 ( CJl )s J s*A- 1 ( C 7 i )x(t i2 ))). 
j=i i=i 

The Liapounov condition with power index 4 follows by verifying that 



(3.3) p 2 ^E 

3=1 



A 



H2, 



i=l 

The limit (3.3) holds if one can prove that, for any x n ,y n 
(3.4) 



0. 



3=1 

To do this, applying Lemma 2.7 of Bai and Silverstein (1998), for any q>2, 
we get 

' maxE|s*A7V)y n x;;A- V)s,f = 0(n~ 1 -^ 2 ), 
E| 7i (a)| 9 = 0(n- 1 - g/2 ) and 



(3.5) 



max J 

i 

maxE|^(cj)| 9 = 0(n-' i '/ 2 ). 

V. 3 



When q > 2, the O(-) can be replaced by o(-) in the first two inequalities. 
The assertion in (3.4) will then easily follow from the estimations in (3.5) 
and the observation that |/3j(c)| < 1. 

3.1.2. Simplification of Y n (u) — EY n (u) . For any x n , y n £ C^, from (3.1), 
we have 

X ;A-V)y n - e x ;a~ V)y« 

n 

(3.6) =^(6E i 7i + E i (6 i -6) 7i 
i=i 

+ (E,- - E J _ 1 )6 J (a)/3 i ( C r)^(a)s* Aj\a)y nX * n Aj\a) Sj ). 
For the third term on the right-hand side of (3.6), applying (3.5), we have 



E 



v ^^(E J -E rl )6 J (a)ft( ff X J (a)s;A:V)y n x;AT 1 ( ff )s J 

3=1 
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n 



3=1 
n 



<p^(E|e j (a)| 4 E|s*ATi( (J )y n x;ATi( (J )s,| 4 ) 1 /2 = ( n -i/2 ) . 

3=1 

For the second term on the right-hand side of (3.6), we have 



E 



3=1 



3=1 



|4xl/2 



1/2 



= o(n~ 3/2 ) • (maxE|tr AjV) - Etr A _1 (tr) 
= o(n" 1 / 2 ), 

where the last step follows from applying the martingale decomposition and 
the Burkholder inequality and using the fact that 



ItrAjV) -trA- 1 ^)! < l/a 



and 



E|tr A'V) - Etr A-^cr)! 4 = 0(n 2 ). 
Thus, we conclude that 

n 

(3.7) v^KA^ V)y n - Ex;A~ 1 (a)y n ) = y/p^bEj^ + o p (l). 

3=1 

3.2. Asymptotic covariances. To compute E, by the limiting property in 
(3.7), we only need to compute the limit 



'*,3 



limp^6(CT i )6((7jOE fc _iEj fc 7 fc (t i i,t i 2,a i )E Jfc 7j fc (tj 1 ,tj2,<Ti), 



k=l 



in which, for any i, k = 1, . . . , K, we have 

7fc(tii.tt2,0i) = s^A^ 1 (o-i)x(t i 2)x*(tii)A~ 1 (cr i )s fc 

-ix^tajAfc^aOAfc^aOxCtia). 
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By Lemma 4, we obtain b(a) — > b{a) = 1/(1 + ym(a)), Thus, we only need 
to calculate 

n 

(3.8) u it j =limp^b(ai)b(a j )E k _iE k ~f k (ti 1 ,ti2,ai)E k ~f k (t jl .t j 2,(Jj). 

k=l 

For simplicity, we will use x,y, u,v, <T\ and cr 2 to denote x(tji), x(tj 2 ), 
x(tji), x(tj 2 ), o"i and cr,-. For X = (X\, . . . ,X p )' of i.i.d. entries with mean 
and variance 1, and A = (Ay) and B = (By) to be Hermitian matrices, the 
following equality holds: 

E(X*AX - tr A)(X*BX - trB) 

= tr AB + | EXf | 2 tr AB T + AuBa (E| X 1 | 4 - 2 - | EXf | 2 ) . 

Using this equality, we get 

, =lini ?M^f tr(EiA - 1(< , l)yx . A - 1(<ri) 

(3-9) 



for the complex case and obtain 
pb(a 1 )b(a 2 ) 

k=l 

x E k A^(a 2 )(vu* +uv*)A- 1 (a 2 )) 



_ Hm m^ Etr(EfcA -i ((Jl)yx * A -i (ai: 

(3.10) 



for the real case. 

One could easily calculate the limit in (3.9) by applying the method used 
in Bai, Miao and Pan (2007) and by using the proof of their equation (4.7). 
Therefore, we only need to calculate the limit of 

^ (ai ^ (a2) EtrE fc (A- 1 (a 1 ) y x*A-^a 1 ))E fc (A- 1 (a 2 )vu*A- 1 (a 2 )) 
k=i 

(3.11) 

= vKai) n b{a2) ^E fc (x*A- 1 (a 1 )A fc - 1 (a 2 )v)(u*A^(a 2 )A fc - 1 (a 1 )y), 

k=l 

where A^ 1 (z 2 ) is similarly defined as A^" 1 (a" 2 ) by using (si, . . . ,Sfc_i,8fc + i, 
. . . , s n ) and by using the fact that Sfc+i, • • • ,s n are i.i.d. copies of Sfc+i, . . . , s n . 

Following the arguments in Bai, Miao and Pan (2007), we only have to 
replace their vectors x n and x* connected with A^ 1 (a\) by y and x* and 
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replace those connected with A^ 1 (a 2 ) by v and u*, respectively. Going along 
with the same lines from their (4.7) to (4.23), we obtain 

E fc x*A^Vi)A,7V 2 )vu*A^V 2 )A,7Vi)y 
(3.12) 



l_^&( ai )&( ff2 )I t rT-V2)T-Vl) 
n n 



= x.*T- 1 (a 1 )T- 1 (a 2 )vu*T~ 1 (a 2 )T- 1 (a 1 )y 

x fl + ^5( C T 1 )6( C 7 2 )-E,._ 1 tr(A- 1 ((7 1 )A- 1 ((7 2 ))') + o p (l) 
\ n n J 

and 

F trfA- 1 ^ W- tr(T- 1 (a 1 )r- 1 ( ( T 2 ))+o P (l) 

E fc tr(A fe (a 1 )A k (a 2 )) ~ , _ {{k _ i)/ n 2) 6((7l ) 6(<T2 ) tr( T-i( ffl )r-i(a 2 )) ' 

where 

T{a)=[a+ r ^b{a)\l. 

We then obtain 

d(a 1 ,a2):=limb(a 1 )b(a 2 )-tr(T- 1 (a 1 )T~ 1 (a 2 )) 

(3.13) 

yb(oi)b(a 2 ) 



(a 1 + b(a 1 ))(a 2 + b{a 2 )) 

and 

/ l (di,cT 2 ):=6(ai)6(c7 2 )x*T- 1 ( C 7 1 )r- 1 (a 2 )vu*r- 1 ( C T 2 )r- 1 ( C Ti)y 

(3.14) 

x*vu*y&(o'l)&( cr 2) 



" (CT1+6( C T1)) 2 ( ( T 2 + 6( C T 2 )) 2 - 

From (3.13) and (3.14), we get 
The right-hand side of (3.11) 

yh(ai,a 2 ) 



o (1 -td(ai,z 2 )) J Q (l-td(a 1 ,a 2 )) 



1 - d(ai,a 2 ) 

yx*vu*yb(ai)b(a 2 ) 



(<ri + b(ai))(a 2 + 6(<t 2 ))[(o-i + 6(ai))(<r 2 + K^)) - yb(<ri)b(a 2 )] 
In addition, from (1.7), we establish 
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Applying these identities, the limit of (3.11) can be simplified to 

x*vu*yW(oi, cr 2 ), 

where 

W(ai,a 2 ) = - 7- -, — Try- -. — rr. 

1 - 2/(1 - 0im(0i))(l - a 2 m(a 2 )) 

By symmetry, the limit of (3.10) for the real case can also be simplified to 

(x* vu* y + x* uv* y)W(a 1 ,a 2 ). 

Therefore, for the complex case, the covariance function of the process Y(tn, 
tj 2 ,cr) is 

EY (ta, t»2, o-i)Y(tji,t j2 , cr 2 ) = l?(tii, t j2 )$(t i2 , tji)W(cri, cr 2 ), 
while, for the real case, it is 

EY(tii,t i2 ,a 1 )Y(t j i,t j2 ,a 2 ) 

= (^(t ll ,t j2 )i?(t il ,t i2 ) + ^(t ll ,t J i)^(t j2 ,t i2 ))^( f T 1 ,a 2 ). 

3.3. Tightness. 

Theorem 3. Under the conditions in Theorem 1, the sequence ofY n (u, a) — 
E(Y n (u,o~) is tight. 

For ease reference on the tightness, we quote a proposition from page 267 
of Loeve (1978) as follows. 

Proposition 1 (Tightness criterion). The sequence {P n } of probability 
measure is tight if and only if: 

(i) supP n (x : \x(0)\ > c) — !► as c— > oo 

n 

and, for every e > 0, as 5 — > 0, we have 

(ii) P»M*) >e) — >0, 
where 5-oscillation is defined by 

w x {8) = sup |x(t) — x(s)|. 
|t-s|<<5 

To complete the proof of the tightness for Theorem 3, we note that condi- 
tion (i) in Proposition 1 is a consequence of finite-dimensional convergence 
which has been proved in the previous section. To demonstrate condition 
(ii) in Proposition 1, we will use the two lemmas given below. Therefore, 
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to complete the proof of Theorem 3, by Proposition 1 and Lemma 1, it is 
sufficient to verify that 



(3.16) sup E 

ui,u 2 eTxT 



y„(ui)-r n (u 2 ) 



4m+2 



< OO. 



Ul,(7i) - (u 2 ,cr 2 ) 
This inequality will be proved in Lemma 2 stated below. 

Lemma 1. Suppose that X n (t) is a sequence of stochastic processes, de- 
fined on an m- dimensional time domain T, whose paths are continuous and 
Lipschitz; that is, there is a random variable R = R n such that 

\X n (t)-X n (s)\<R\\t-s\\. 

If there is an a> m such that 

(3.17) E|i?| a <oo, 
then, for any fixed e > 0, we have 

(3.18) \imP n (cj x (5) >e)=0. 

54,0 

Proof. Without loss of generality, we assume that T = [0,M] m . First, 
for any given e > and 5 > 0, we choose an integer K such that MK~ l < 5 
and 2 a K m ~ a < 1/2. For each I = 1, 2, . . . , we define 

UU,£) = J -^, j = i,...,K e . 

Denoting by t(j,£), j = (ji,...,j m ), the vector whose ith entry is ti(ji,£). 
Then, we have 

Pn > S) 



<2P sup sup |X re (t)-X re (t(j,l))|>£/2 

V j,l |t-t(j,l)|<2A/ v ^/^ 
L 

^E E 2P(|X n (t(j*,^))-X n (t(j,£ + l))|>2^ 1 £ ) 

+ 2P( sup sup \X n (t)-X n {t{j,L + l))\ >2~ L ~ 2 e 

V t(j,L+l) \\t-t(j,L+l)\\<2^/mM/K L 



<±2(K>/Mr(£S^Ym 



a 



1=1 

2 2+3a £- a m a / 2 (M/K) a - m E\R\ a 
2 2+3a e- a m a / 2 5 a - m E\R\ a ^ as S -> 0, 
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where the summation Yl(\£+i) runs over an possibilities of ji < K +1 , and 
t(j*,£) is the t(j,^) vector closest to t(j,£ + 1). Here, to prove the first 
inequality, one only needs to choose t(j,l) as the center of the first layer 
hypercube in which |(t + s) lies. The proof of the second inequality could 
be easily obtained by applying a simple induction. In the proof of the third 
inequality, the first term follows by the Chebyshev inequality and the fact 
that 

|Xn(t(jV)) - X n (t(j,£ + 1))\ < #||t(jV) - t(j,£+ 1)|| < R^xMjK 1 . 

At the same time, the second term tends to for all fixed n when L — > oo 
because 

P( sup sup |X„(t)-X n (t(j,L + l))| >2" L - 2 e) 

H(j,L+l) ||t-t(j,L+l)j|<2A/ v ^i^-- £ - 2 7 

< P(\R\ > (K/2) L+2 e/2M^h~) -»• 0. 
Thus, the proof of the lemma is complete. □ 

Lemma 2. Under the conditions of Theorem 1, the property in (3.16) 
holds for any m. 

Proof. For simplicity, we only prove the lemma for a general m instead 
of 4m + 2. For a constant L, we have 

w ^(ui.ffi) - Y n (u 2 ,a 2 ) -E(r n (ui,<Ti) - K ra (u 2 ,q 2 )) m 
||ui - U 2 || + \o~i - cr 2 1 

x n (t 1 )*A' 1 (a 1 )x ra (t 2 ) - Ex ra (ti)*A- 1 ((7i)x n (t2) 

— t3|| + ||t 2 — t4|| + |o"i — CT 2 1 

(x ra (t 3 )*A-V 2 )x n (t 4 ) -Ex n (t 3 )*A- 1 (a 2 )x w (t 4 )) m 

1 1 "t 1 — "t 3 1 1 + ||t 2 — t4|| +\(T\ — 0~2\ 

< Ln m / 2 {E|((x n (t 1 ) - x„(t 3 ))*A- 1 (a 1 )x n (t 2 ) 

- E(x n (tx) - X n (t 3 ))*A- 1 ( C T 1 )x rl (t 2 ))(||t 1 - tgll)" 1 !" 1 

+ E|(x;(t 3 )A- 1 (fTi)(x n (t 2 )-x n ,(t4)) 

-Ex;(t 3 )A- 1 (a 1 )(x n (t 2 )-x„(t4)))(||t 2 -t 4 ||)^T 

+ E|x;(t 3 )A^ 1 (a 1 )A^ 1 ( C r 2 )x n (t4) 

- Ex^A-y^A- V 2 )x n (t 4 )n, 

where a >c b means a and b have the same order, that is, there exists a 
positive constant K such that K~ l b < a < Kb. 
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We note that ||x n (ti) — x n (t3)||/||ti — ±^3 1 1 < 1 or bounded for the general 
case. By applying the martingale decomposition in (3.1), the Burkholder 
inequality and the estimates in (3.5), we have 

(x n (ti) - x n (t 3 ))* A- 1 ((7i)x n (t 2 ) - E(x n (ti) - x n (t 3 ))*A- 1 (ai)x n (t 2 ) 



V 2 E 



l|ti-t 3 || 
= 0(1). 
Similarly, we obtain 



,/2 E 



x* (t 3 )A- 1 (^i)(x ri (t 2 ) - x n (t 4 )) - Ex* (t 3 )A- 1 ( C j 1 )(x n (t 2 ) - x n (t 4 )) 



l|t 2 -t 4 || 
= 0(1). 

Using the martingale decomposition and the Burkholder inequality, we get 
n m / 2 E| X ;(t 3 )A- 1 ( ( r 1 )A- 1 ( ( T 2 )x n (t4)-Ex;(t 2 )A- 1 ( C T 1 )A- 1 ( C T 2 )x n (t4)r 



< Ln m l 2 ]TE|x;(t 3 )[A- Vx)A- V 2 ) - A^ 1 (a 1 )A fe - 1 (^)]x n (t 4 ) 

Jc=l 

+ E(^E fc _ 1 | X ;(t 3 )[A- 1 (a 1 )A- 1 (a 2 ) 
\fc=i 

\ m/2- 

- A~ k 1 (a 1 )Al\a 2 )]^ l {U)\ 2 \ 
= 0(1), 

which follows from applying the following decomposition: 
A- 1 (cT 1 )A- 1 (^)-A- 1 (a 1 )A- 1 (cr 2 ) 

= /3(a 1 )A^ 1 (a 1 )s (t s^A^ 1 (a 1 )A,T 1 (a 2 ) + /3(a 2 )A,T 1 (^)A^ 1 (a 2 )s (fc s^A^ 1 ( C T 2 ) 

and thereafter employing the results in (3.5). Thus, condition (3.16) is ver- 
ified. □ 

4. Proof of Corollary 2. Applying the quadratic equation (1.7), we have 
(4.1) a = - 



m 1 + ym 
Making a difference of a\ and cr 2 , we obtain 

m(cr 2 ) -m(cri) y(m(a 2 ) - m(ai)) 



o\ - a 2 



m(ai)m(a 2 ) (1 + ym(ai))(l + ym(a 2 )) ' 
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We also establish 

m(a 2 ) - m(ai) m{ai)m{a 2 ){\ + ym(o~i)){l + ym(a 2 )) 



(4.2) 

cji - 02 (1 + j/m(<7i))(l + ym(a 2 )) - ym(a 1 )m(a 2 ) 

Finally, we conclude that 
m(cr 2 ) - m(oi) 



m(ai)m(a 2 ) 

W{ai,a 2 ) 



<J\ - cr 2 

ym 2 (ai)m 2 (a 2 ) 



(1 + ym(<7i))(l + ym{a 2 )) - ym{ai)m{a 2 ) 

by noticing that 1 + ym{a) = m(o")/(l — o~m(o~)), an easy consequence of 
(4.1). 

Furthermore, one could easily show that the left-hand side of the above 
equation is 

dF y (x) f b dF y (x) f b dF y (x) 



(x + ai)(x + a 2 ) J a x + aiJ a x + a 2 ' 
By using the unique extension of analytic functions, we have 

W(ZUZ2)= t iF f> f^JM ram. 

Ja {X-Zi){x-Z 2 ) J a X-Zl J a X-Z 2 

Substituting this into Corollary 1, we complete the proof of Corollary 2. 

APPENDIX 

Lemma 3 [Theorem 35.12 of Billingsley (1995)]. Suppose that, for each 
n, X n x,X ni 2, ■ ■ ■ ,X n rn is a real martingale difference sequence with respect 
to the increasing a-field {J- n ,j} having second moments. If, as n—> oo, 



(i) ^K;^./;., ,) ^ and 
(") E E ^ J (l^l> £ ))^0, 

3=1 

where a 2 is a positive constant and e is an arbitrary positive number, then 

^X n ^N(0,a 2 ). 

3=1 

In what follows, Sj, A -1 and Aj 1 are defined in Section 3 and Mj and 
M refer to any pair of matrices which are independent of s,-. 
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Lemma 4. Under the conditions of Theorem 1, for any matrix Mj bounded 
in norm and independent of Sj , we have 



(A.l) max 

3 



1 {s*MjSj tr Mj ) 



The proof of this lemma could be easily obtained by applying the trun- 
cation technique and invoking Lemma 2.7 of Bai and Silverstein (1998). 

Lemma 5. Under the conditions of Theorem 1, for any x n ,y n € C^, 
(A.2) sup KA^My,, - x^A^My^ ^ 0. 

3 

Similarly, for any matrix M with bounded norm and independent of s, , we 
have 

(A.3) m a x\E jX * n Aj l (a)My n - E i x;A^ 1 (a)My n | ^> 0. 

Proof. Using 
(A.4) A- 1 (a) = Aj 1 (a)-Aj 1 (a)s :i s*A- 1 (a)(3 J (a), 

we obtain 

sup |x* A _1 My n - x* A^My^ < supH^Aj^SjA^Mynl 

3 3 n 

= sup — |x* At 1 AT 1 My n | + o(l), 

j n 3 J 

which, in turn, implies (A.2). Here, we adopt (A.l) in the last step above. 
The conclusion (A.3) can be proved in a similar way. □ 

Lemma 6. Under the conditions of Theorem 1, for any x n ,y n £ C^, we 
have 

(A.5) x;A- 1 (a)y n - <y„m(ff) ^ 
and 

(A.6) max|E,-x^AT 1 ( C r)y n - x£y n m(cr)| 0. 

3 J 

Proof. By using the formula A = ^j=i s j s j + a ^ anc ^ multiplying x* 
from the left- and multiplying A _1 y n from the right-hand side of the equa- 
tion, we obtain 

1 n 

< A ~ V)yn = ^ -1 x;y n Vx;s i s*AT 1 ( f7 )/3 i (cj)y n , 

no *■ — ' J 

3=1 
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As (3j(a) -^-4 b = 1+ y m ^ uniformly in j, we apply Lemmas 4 and 5 and 
obtain 

x;A" 1 (cr)y n = cr _1 x*y„ - c7~ 1 x;A~ 1 (cj)y n 6(cr) + o(l). 
This, in turn, implies that 

cr + 6(a) 

The conclusion in (A. 5) could then follow from the fact that 

a + b(a) 

whereas the conclusion in (A. 6) can be proved by employing the same 
method. □ 

Lemma 7. Under the conditions of Theorem 1, for any x n ,y„ £ C^, we 
have 

v / n(x*E(S n + al)~ 1 y n - x* n y n m n (a)) — > 0. 

Proof. When y n — x n , Lemma 7 in our paper reduces to the conclusion 
(5.5) —> as shown in Bai, Miao and Pan (2007). To complete the proof, one 
could simply keep x* unchanged and substitute x n by y n = (x* y n )x n + z n 
in the proof of the above conclusion. Thereafter, the proof of this lemma 
follows. □ 
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